Core organization of directed complex networks 
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The recursive removal of leaves (dead end vertices) and their neighbors from an undirected 
network results, when this pruning algorithm stops, in a so-called core of the network. This 
specific subgraph should be distinguished from fc-cores, which are principally difi'erent subgraphs 
in networks. If the vertex mean degree of a network is sufficiently large, the core is a giant cluster 
containing a finite fraction of vertices. We find that generalization of this pruning algorithm to 
directed networks provides a significantly more complex picture of cores. By implementing a rate 
equation approach to this pruning procedure for directed uncorrelated networks, we identify a set 
of cores progressively embedded into each other in a network and describe their birth points and 
structure. 
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I. INTRODUCTION 

In terms of connectivity, a network, generally, is a set 
of connected components, which may include a giant con- 
nected component containing a finite fraction of vertices 
and edges (which is well-defined in infinite networks) lj~ 
13]. Studies of random graphs, starting from the first 
works Q , essentially focused on the connected compo- 
nents, their statistics and structure [7]. A natural gen- 
eralization of connected components are fc-cores, which 
are obtained by a special pruning procedure, namely the 
recursive removal of vertices with degrees smaller than 
fc jl, 1^. The fc-core structure of complex networks is 
being studied extensively [HI- On the other hand, 
other key subgraphs of networks, so-called cores, are far 
less studied. For an undirected network, the core is a 
final result of the application of the following pruning 
algorithm. Remove recursively the leaves (by definition, 
vertices of degree 1) with their nearest neighbors from 
the network. Speaking in more detail, at each step, re- 
move a randomly chosen leaf and its neighboring vertex 
from the network. Continue the process until no leaves 
remain. The resulting subgraph consists of finite com- 
ponents and, if the mean vertex degree (g) exceeds some 
threshold, of a giant component containing a finite frac- 
tion of vertices. It is this giant component that is usually 
called the core of a network The cores are related 
to a wide range of topical problems for networks such 
as controllab ility of networks [l^, localization on ran- 
dom graphs [14( 1 ■ and some combinatorial optimization 
problems like maximum matching jl5l . [l6j and minimum 
vertex cover 

As a first application, this pruning algorithm was used 
by Karp and Sipser who intended to find a maximum 
matching in graphs [Tsj (at least, some of the maximum 
matchings), where a matching is a set of edges of a graph 
which have no common vertices. They proposed an al- 
gorithm which recursively selects an edge (i, j) for which 
vertex i has degree 1. This edge is included into the 



matching. Then all the edges incident to vertex j, in- 
cluding the edge (i,j) are removed from the graph, and 
the process is recursively repeated. If the resulting graph 
has no leaves (e.g., when the graph is already reduced to 
the core) choose an edge at random, include it into the 
matching and remove all the edges incident to the end 
vertices of this edge. These steps are repeated until all 
the edges will be removed. The result of this procedure 
was suggested to be a maximum matching. Although it 
was shown that the Karp-Sipser algorithm, in general, 
actually does not generate the maximum matchings |20| 
it is often used as a preproces sing step for other maxi- 
mum matching algorithms (2TI . [22j |. 

For tree-like graphs, after the pruning, a remaining 
subgraph may consist only of isolated vertices and a giant 
core. Bauer and Golinelli studied the leaf removal algo- 
rithm on the Erdos-Rcnyi graphs |I2:] , which are tree-like 
in the infinite network limit. Using generating functions, 
they showed that there is a continuous phase transition at 
the mean degree (g) = e, where e = 2.718..., so that be- 
low the transition the remaining subgraph contains only 
isolated vertices and above the transition this subgraph 
consists of isolated vertices and a giant core. 

Previous studies of cores were mostly focused on undi- 
rected networks, while numerous real- world networks, 
similarly to the World Wide Web, are directed graphs 
(digraphs), i.e., their edges are directed. A leaf in di- 
rected networks is defined as a vertex of in- or out-degree 
equal to 1, which gives in- or out-leaves respectively. The 
result of the recursive removal of these leaves (the algo- 
rithm is explained below in detail) is the core of a di- 
rected network [l^. Importantly, there were found close 
relations between the intensively studied controllability 
of directed networks and the emergence of a core and 
the problem of maximum matching for directed graphs 
[l3| . These findings stimulated particular interest in the 
cores in directed networks. Recently, Liu et al. presented 
analytically derived results for cores in directed random 
graphs [23j. They defined however the core in a different. 
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peculiar way. They transformed a directed network into 
a bipartite graph and then found the core of this graph as 
if it was undirected unipartite. This convenient approach 
crucially simplifies the task and allows for a straightfor- 
ward generating function technique but it actually leaves 
unsolved the original problem of the core structure of 
directed networks as it was defined in Ref. [Tsj . 

In this paper, we adapt the leaf removal algorithm to 
the case of digraphs. This extended leaf removal algo- 
rithm is introduced in such a way that it fits the defi- 
nition of matching patterns on digraphs. In digraphs, a 
matching is a subset of edges such that no two edges in 
the set share a common starting vertex or a common end- 
ing vertex [2l|. Similarly to the Karp-Sipser algorithm, 
used for constructing of a matching on undirected graphs, 
in the algorithm for digraphs we choose recursively one 
directed edge in which vertex i is leaf. This edge 

is included in the matching and removed from the graph 
together with outgoing edges of vertex j, if vertex i is 
in-leaf, or together with incoming edges of vertex j, if 
vertex i is out-leaf. This allows one avoiding forbidden 
configurations in the matching. 

The simplicity of the leaf removal algorithm for di- 
rected networks enables us to describe the evolution of 
the structure of the network during the pruning pro- 
cess by applying rate equations. Similar, though signif- 
icantly more compact, rate equations have been derived 
for the description of the leaf removal algorithm on undi- 
rected graphs, while constructing minimum vertex covers 
[25} . Using these rate equations we reobtain the size 
of a core in the configuration model of undirected random 
networks and the number of links in this core. For the 
Erdos-Renyi undirected random graphs, we compare our 
results for cores with those obtained by the generating 
function technique fl^l and demonstrate the preciseness 
of the rate equation approach. In this work we apply 
the extended leaf removal algorithm to the configuration 
model of directed uncorrelated networks with a given de- 
gree distribution and derive rate equations which allow 
us to find the structure of the cores in directed networks. 
For directed networks with Poisson and power-law de- 
gree distributions we compare our results with simula- 
tions in Ref. [l3j and observe a complete agreement. In- 
and out-leaves in directed networks can be classified ac- 
cording to the number m of their outgoing and incoming 
edges, respectively. This enables us to naturally decom- 
pose a directed network, layer by layer, removing leaves 
with higher and higher m, to sub-cores; we name them 
m-cores. Note that this decomposition of the directed 
networks differs principally from the well-known fc-core 
decomposition of networks Q . 

This paper is organized as follows. In the next section, 
for the sake of clarity, we first apply the rate equation 
approach for leaf removal [H, to the core extrac- 
tion problem for undirected uncorrelated networks and 
demonstrate the preciseness of this approach. In Sec. IIIII 
we introduce in detail the extended leaf removal algo- 
rithm for directed networks and derive the rate equations. 



By using these equations we find the basic characteristics 
of cores in diverse directed uncorrelated networks and ex- 
plore the m-core decomposition of directed networks. 



II. UNDIRECTED NETWORKS 
A. Rate equations 

Let us first consider an undirected uncorrelated net- 
work with N vertices, L edges and a degree distribution 
P(q), w hich is described here by the configuration model 
[2^ [231, i.e., a labelled random graph with a given degree 
sequence. We apply the leaf removal algorithm to 
this network. According to this algorithm, at each step 
we choose at random a leaf and remove it together with 
its nearest neighbor vertex and all the incident edges to 
this neighbor. The procedure is iterated until no leaves 
remain in the network. 

The structural evolution of the network during the leaf 
removal process is described by so-called rate equations 
for the degree distribution of the remaining network fl^, 
[25| . For the sake of comparison with directed networks, 
let us remind its meaning (for more detail, see the original 
derivation in Refs. [l^, [23). Let N{q,t) be the average 
number of vertices with degree q and N{t) be the total 
number of remaining vertices at rescaled time t = T /N , 
where T is the number of algorithmic steps. So At = 1 /N 
as a rescaled time of one iteration. The change of the 
average number of vertices with degree q after one step 
is described by the following expression: 

N{q,t + M)=N{q,t)~5q^^ 

qP{q,t) ^ {q{q-l))t{q + l)P{q+l,t)-qPiq,t) 
{<l)t {q)t {q)t 

The delta function in the right-hand side shows that at 
each step one leaf is removed. In an uncorrelated net- 
work, the degree distribution of the end vertices of a uni- 
formly randomly chosen edge is P'^^\q,t) = qP{q)/{q). 
So when we remove the nearest neighbor of the leaf, the 
number of vertices with degree q, is decreased by 1 with 
probability P^^\q,t). After the removal of the leaf and 
its nearest neighbor with their edges, the number of con- 
nections of the second-nearest neighbors of the leaf di- 
minishes. So finally the last term describes the change 
of N{q) due to the decrease by 1 of the degrees of the 
second-nearest neighbors of the removed leaf. The num- 
ber of the second neighbors, in average is equal to the 
mean degree of the nearest neighbor vertex except one 
(connection to the leaf), that is, ^^iq — l)P^^^q,t) — 
{q{q—l))t/{q)t- Note that the last term on the right-hand 
side of Eq. ([T|) can be derived strictly implementing a sin- 
gle assumption, namely, the absence correlations in the 
network during the pruning process. 

At each step, the total number of the vertices in the 
network decreases by 2. The number of removed edges, in 
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FIG. 1. Different regimes of the evolution of the fraction of leaves P(l) and the corresponding mean degree generated by the 
leaf removal algorithm applied to the Erdos-Renyi undirected graph with different mean degree cq: (a) and (b) co = 2.5 is 
below the critical point, (c) and (d) co takes the critical value equal e ~ 2.72, (e) and (f) co — 3.0 is above the critical value. 



average is equal to the mean degree of the first neighbor 
of the leaf. This results in the following equations: 
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{<l)t • 



(2) 



(3) 



Substituting N{q,t) = P{q,t)N{t) into Eq. (P gives the 
following evolution equation for the degree distribution 

El Hi: 



il~2t)Piq,t) = 2P{q,t)^S. 



(g(g-i))^ 



{q+l)P{q+l,t). (4) 



Eq. (|4]) is actually a set of differential equations, describ- 
ing the evolution of a network during pruning. These 
equations are seemingly similar to the rate equations de- 
scribing the evolution of networks driven by the preferen- 
tial attachment mechanism [2^, [2^ . The key difference 
however is that Eq. (jlj is nonlinear due to the moments 
of the degree distribution P{q,t) which enter the right- 
hand side of the equation. This nonlinearity makes the 
solution of the equation particularly difficult. Notably, 
while deriving, it was assumed that the network remains 
uncorrelated during this pruning process. This is actually 
a strong assumption, which was never specially proved. 
For example, in a well-studied problem of connected com- 
ponents in uncorrelated networks, the giant component 
generally have degree-degree correlations 30]. This is 
why it is interesting to compare results for cores obtained 



by using the rate equations with those found by other 
methods. 

The set of equations (H]) was treated analytically only 
while studying vertex covers for the Erdos-Renyi ran- 
dom graphs f25i] . The analysis was based on the following 
ansatz. It was assumed that the degree distribution stays 
Poissonian for q > 1 during the whole leaf removal pro- 
cess (though mean degree varies). In the present paper 
we use a different approach, without making this assump- 
tion. We directly solve numerically the set of these equa- 
tions (0 < 5 < (?max) and obtain the structure of the core 
analyzing the evolution of the degree distribution during 
the pruning process. Note that we consider infinite net- 
works which may have vertices with degrees approaching 
infinity. In this situation we have to truncate the set of 
evolution equations. 



B. Cores in undirected networks 

We apply the leaf removal algorithm to an undirected 
uncorrelated network with degree distribution P{q, t ^ 0) 
and a vertex mean degree equal to cq, as initial condi- 
tions. Algorithm is iterated until no vertices of degree 
1 remain. The degree distribution of the network is ob- 
tained at each time step by solving Eq. We fo- 
cus on the evolution of the relative number of leaves, 
P{l,t). Changing the initial mean degree cq, we observe 
two different behaviors for P{l,t) separated by a crit- 
ical point, see Fig. [T] Below the critical point, P{l,t) 
becomes zero exactly in the moment in which the ver- 
tex mean degree in the remaining networks reaches zero. 
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FIG. 2. (a) Relative size ric — Nc/N and (b) the normalized 
number of edges Ic — Lc/N in a core for different undirected 
networks versus the vertex mean degree of an original net- 
work. The networks are the Erdos-Renyi random graph (ER) , 
the scale-free networks (SF) with the degree distribution ex- 
ponents 7 = 4, 3.6, 3.4, and the network with an exponentially 
decaying degree distribution, (c) and (d) The comparison of 
our results (dots) for the Erdos-Renyi undirected graphs with 
the corresponding results obtained by the generating function 
technique (solid lines) 



This means that the algorithm finishes when the net- 
work disappears. Contrastingly, above the critical point, 
P(l,i) becomes zero at a time t* at which the mean de- 
gree is still nonzero. That is, at this moment, there still 
remains a subgraph with vertices of degrees greater than 
1, i.e., the core. The number of remaining vertices except 



isolated vertices at time t* , is the size of the core: 

Na = N{t*) ~ N{0, t*) = N{t*) - N{t*)P{0, t*) 

= iV(l - 2r)[l - P(0,t*)]. (5) 

The number of edges in the core is obtained from Eq. ([3]) 
into which we substitute the solution P{q,t) of Eq. ([5]), 
namely, 



Lit*). 



(6) 



To demonstrate this approach, we apply it to a few 
undirected uncorrelated networks. First, consider the 
Erdos-Renyi random graphs, which have a Poisson de- 
gree distribution, P{q) = e~'^°CQ/q\, in which cq = 2L/N 
is the mean vertex degree for the graph. The rapid de- 
cay of the Poisson distribution allows us to reduce greatly 
the set of the evolution equations which we solve numer- 
ically. For the mean degrees cq in the range between 
1 and 8, it is sufficient to solve the set of the first 21 
equations, Qmax = 20. Figure [T] demonstrates three dif- 
ferent regimes of the leaf removal process. Note that at 
the critical point, cq ~ 2.72 e, the curve P(l,i) is 
tangent to the t axis at t* , see Fig. 1(c). In Fig. [2] we 
show the relative size of a core, Uc = Nc/N and the nor- 
malized number of edges in a core, Ic = Lc/N for the 
Erdos-Renyi undirected graphs. The points on the plots 
are obtained in the following way: for each considered 
value of initial mean degree of the network Cq, we find 
the time t* at which the leaf removal process is com- 
pleted and the evolution of the distribution P{q,t < t*) 
and so of its moments, and then the size and the number 
of edges are calculated from Eqs. dS]), (O, and ([6|). Our 
approximate value for critical point is cq ~ 2.72, which 
agrees with Refs. [13, HI]- Moreover, the comparison of 
the dependencies of the size and the number of edges in 
the core on the vertex mean degree cq found by the nu- 
merical solution of the rate equations and those obtained 
by using generating functions reveals a complete agree- 
ment between these two approaches, see Fig. ^c) and 
(d). This suggests that the evolution equations, which 
were used, are actually asymptotically exact, similarly 
to the approach of Ref. [12]. 

We also considered uncorrelated networks with an ex- 
ponential degree distribution P{q) = (1 — e~^/")e~'/", 
where cq — e^^^'^/{l — e^/''). The critical point is at 
Co « 4.0 as is shown in Fig. [2l In the case of scale-free 
networks P{q) oc q~'^, Uc and Ic depend on both the 7 
exponent and the mean degree cq. We consider asymp- 
totically scale-free networks with the degree distribution 



_ r co(7-2) 

2(7-1) 



ir(g-7 + i,i^)/r(<z + i) = g- 



where T{s) is the gamma function and T{s,x) is the up- 
per incomplete gamma function [31|. This degree dis- 
tribution is typical in the static model [32i] . which is 
extensively used in simulations. This is why this form 
will allow for comparison with results obtained in nu- 
merical simulations. Here we only explore the region 
7 > 3. To obtain the dependencies nc(co) and lc{co) 
for scale-free networks shown in Fig. [2Ia,b) we solve Eq. 
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FIG. 3. Various out- and in-leaves in directed networks. For 
out- and in-leaves, the number m of, respectively, incoming 
and outgoing edges is arbitrary, m — 0,1, 2, .... In the special 
case of qin ~ Qout = 1, the vertex is simultaneously an in- and 
out-leaf. 



dl]) with the cutoff gmax = 100, ignoring the higher de- 
grees. These dependencies agree with those obtained for 
cores of these networks within the generating function 
approach in Ref. [23| . 



III. DIRECTED NETWORKS 

In directed networks, each vertex is characterized by 
its in-degree qm and out-degree Qout, and the joint in, out- 
degree distribution of a network is P{qin,(lout)- In the 
directed networks, leaves are defined as vertices with 
Qin = 1 or Qout = 1, see Fig. [31 In general, removing 
of all leaves leads to a giant core above a critical point. 
Importantly, the leaf removal algorithm for directed net- 
works should be formulated in such a way that this algo- 
rithm will be applicable for combinatorial optimization 
problems on directed graphs, like the matching problem. 



A. Leaf removal algorithm: rate equations 

Let us generalize the leaf removal algorithm to directed 
networks. By definition, the leaves here are vertices with 
Qin — 1 and an arbitrary number (including zero) of out- 
going edges (in-leaf) or with qout = 1 and an arbitrary 
number (including zero) of incoming edges (out-leaf). In 
the particular case qtn = qout = 1, the vertex is simul- 
taneously an in- and out-leaf, but this coincidence does 
not affect the leaf removal process. The extended leaf re- 
moval algorithm is defined in the following way. At each 
step, one out-leaf or one in-leaf is uniformly randomly 
chosen with probability 1/2. The single outgoing edge or 
incoming of this out- or, respectively, in-leaf is removed, 
so that this vertex becomes a "normal" (not a leaf) vertex 
(notice that we do not delete this vertex because it may 
still have connections with other its neighbors). Next, we 
focus on the neighbor vertex which was connected with 
the leaf by that removed edge. If the leaf was an out- 
leaf, we also remove all its neighbor's incoming edges; if 
that leaf was an in-leaf, we remove all its neighbor's out- 
going edges, since they are not allowed to enter in the 



FIG. 4. Leaf removal process on directed networks. The 
open dots show out- (see left) and in- (see right) leaves. Once 
a leaf is selected, the dashed edges are removed, and the leaf 
becomes a normal vertex. 



corresponding matching pattern. Figure |4] explains this 
process. This procedure is iterated until no leaves are re- 
mained. Importantly, in contrast to undirected networks, 
during this process, the number of vertices in the network 
is conserved. The leaves, being removed, are transformed 
into normal vertices. Interestingly, in this procedure, a 
vertex may change its status several times. For exam- 
ple, the following evolution of a vertex is possible. In the 
original network the vertex is normal, then it becomes an 
in-leaf, afterwards it is transformed again into a normal 
vertex, later it becomes an out-leaf, and finally it is an 
isolated vertex. 

Another marked difference from undirected networks 
is that in directed networks leaves can have various num- 
bers of edges, see Fig. [31 We propose to distinguish 
and classify leaves in directed networks by the number 
of their connections. In our leave removal algorithm we 
recursively remove leaves having qout — 1 , qin < m or 
qin = Ij^ojit < fn, which results in the specific subgraph 
which we name m-core. Here m is a given non- negative 
integer. If m equals the highest vertex in, out-degree in 
the graph, i.e., max {qin, qout), the algorithm surely re- 
moves all possible leaves from a network, which finally 
results in the main (or central) core or, as we call it here, 
simply, the core. 

Let us derive a master equation, describing the evolu- 
tion of the structure of an uncorrelated directed network 
during this algorithm. Let N (qin, qout, t) be the aver- 
age number of vertices with in-degree qin and out-degree 
qout at time t. As a consequence of the removal of edges, 
N (qin, qout, t) is changed after each iteration. The dy- 
namics can be decomposed into three steps. 

(1) Choosing a leaf. We choose uniformly at random 
vertices with degrees qout = ^,<lin ^ "m- (out-leaves) or 
qin = 1, qout < rn (in-leaves). At each step we choose one 
leaf. Suppose that we selected an out-leaf (this choice 
is made with probability 1/2). The probability that 
the in-degree of this leaf turns out to be qin, is equal 
to e{m - 9in)f (gm,l)/Eq„.<m^fe"'l)]- ^ere e{i) is 
defined for integers: 9{i>0) ~ 1 and 9{i < 0) = 0. 
We must remove the outgoing edge of this leaf, and 
so with this probability the number of vertices with 
qout = 1 , qin < TTt decreases by 1 while the number of ver- 
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tices with qout = 0,qin < rn increases by 1. In the expres- 
sion for the difference N{qin, qout, t + At) - N{qin,qout,i) 
[see Eq. ([7]) befow], this gives the first two terms on the 
right-hand side. Similar arguments are vahd if the se- 
lected leaf is in-leaf. Two corresponding terms can be 
seen on the right-hand side of Eq. Q . 

(2) Changing the degree of that leaf's nearest neigh- 
bor which is connected to the leaf by a single incom- 
ing or outgoing edge. The probability that this outgoing 
edge of the out-leaf at its second end has a vertex of 

degree qi„,qout is Poutiqin,qout) = qinP{qin,qout) / {qin) ■ 
Therefore with this probability the number N{qin,qout) 
is decreased by one, which gives the third term on the 
right-hand side of Eq. ([7]) . Recall that we remove all in- 
coming edges to this vertex, and so its in-degree changes 
to qin = 0. Consequently, the number of vertices with 
qin = increases by Sq,^^oJ2qi„<linP{qin, qout) /{qin), 



which is the fourth term on the right-hand side of Eq. (O . 
A similar argument is valid when we select an in-leaf. 

(3) Changing the degree of leaf's second neighbors. 
When we select an out-leaf and remove its outgoing edge, 
we also remove all incoming connections from the leaf's 
neighbor on this edge, see Fig. H) Consequently, the out- 
degrees of the corresponding leaf's second neighbors de- 
crease by 1. The number of these incoming edges except 
the edge attached to the leaf, in average is equal to the 
mean in-degree of the nearest neighbor of the leaf mi- 
nus 1, i.e., J2q,r,.qa.u ~ i)Pini9in,qout)- This leads to 
the fifth term on the right-hand side of Eq. ([7]). Similar 
arguments are valid when we select an in-leaf. 

Applying all these arguments we finally obtain the evo- 
lution equation for the degree distribution P{qin,qout, t)- 



N {qin, qout, t + At) - N {qin, qout, t) ^ P {qin, qout, t) 

_ 1 / _ ^go„f - qin)P{qin, 1, t) 5q^^^fi6{m - qtn)P{qin,0, t) _ qin P {qin, qout, t) 
"2!^ Y.q^^<mP{q^n,l,t) E,„<™^fen,0,i) {q,n)t 

^ ^q,r.,0 J2qi„ qinP{qin, qout, t) ^ {qin{qin - (gout + l)P{qin,qout + l,t) - qout P {qin , qout , t) \ 

{qin)t {qin)t {qout)t J 

_l_ 1 I _ Sq^^^id{m - qout)P{l, qout,t) _^ 5q.^fi6{'m - qout)P{0, qout,t) _ qout P {qin, qout, t) 
2\ T,q„^,<mP{'^,qout,t) Y.q,^,<mP{^,qout,t) {qout)t 

^ '^go„t,oZ]g„„t qoutP {qin, gout, t) ^ {qout{qout - l))f {qin + i)P{qin + I, qout, t) - qm P {qin , qout , t) 

{qout)t {qout)t {qin)t 



(7) 



We used the fact that, no vertices are removed in this pro- 
cess in contrast to undirected networks, i.e., N{t + At) = 
N{t) = N. The time step is At = 1/N, and in the 
large network limit we passed from the discrete differ- 
ence to the time derivative of the degree distribution 
P{qin,qout,t) = N{qin,qout,t)/N. Equation Q is ac- 
tually an infinite set of nonlinear differential equations, 
even more difficult to solve than Eq. (jlj for undirected 
networks. 

It turns out possible to express the time derivative 
of total number of edges of the network in terms of 
the moments of the degree distribution. If we choose 
an out-leaf, we remove all incoming edges of the leaf's 
nearest neighbor (this neighbor is at the second end of 
the outgoing edge of the leaf). The number of these 
edges in average is equal to the mean in-degree of the 
leaf's nearest neighbor vertex. The probability that we 
reach a vertex of degree , qout , following an outgoing 

edge of the leaf is Pout{qtn,qout) = qinP{qin,qout) / {q^) ■ 
Therefore, the mean in-degree of the leaf's neighbor 

is J2q,„,q,^^HnPoutiqin,qout) = {qfn)t/{qin)t- If au iu- 



degree leaf is chosen, the number of edges which are re- 
moved at one step in average is equal to the mean out- 
degree of the leaf's nearest neighbor, which is, similarly, 
{qout) t/ {qout) t- Choosing out-degree or in-degree leaves 
with equal probability (1/2) leads to the following result: 

m _ if {qfn)t {qLt)t \ .g^ 
N 2\{q,n)t {qout)t)' ^ ' 

Our numerical results for cores in directed networks are 
based on Eqs. ([7]) and pO)) . One may suggest that they 
are asymptotically exact, similarly to Eqs. ©-(HI) for 
undirected uncorrelated networks which we checked, see 
Figs.Hcd). 

B. m-core structures 

As was explained, we can run the algorithm for dif- 
ferent values of m and so arrive at different m-cores. 
The smallest number m is 0. In this case, only leaves 
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FIG. 5. (a) Relative size and (b) the normalized number 
of edges of a core versus co (the mean in- or out-degree of a 
vertex in a network) for different directed networks: the di- 
rected Erdos-Renyi graph (ER), the scale- free networks (SF) 
with the degree distribution exponents 7 = 4, 3.6, 3.4, and the 
network with an exponentially decaying degree distribution. 



with q^n = l,qout = and = O.qout = 1 are ran- 
domly chosen. The leaf removal algorithm is applied to 
the network until none of these leaves remain. The re- 
maining subgraph is 0-core. If we take m = 1, we re- 
cursively remove the leaves with qin = 1, qout = 0,1 and 
Qin = 0, 1, qout = 1- In this case, we obtain a smaller sub- 
graph within the 0-core, namely the 1-core. Similarly, re- 
cursively removing leaves with degrees qin = 1 , qout < m 
and qin < qout = 1 we arrive at the m-core (should 
not be confused with ordinary /c-cores). In this way we 
decompose a directed network into an onion-like struc- 
ture of m-cores. Although this m-core decomposition of 
the directed networks differs principally from the well- 
studied fc-core decomposition, they have something in 
common. In particular, vertices belonging to inner cores, 
have higher connectivity. This is why the position of a 
vertex in the system of m-cores may be a better connec- 
tivity metrics than the vertex degree which is only a local 
characteristic. 

To find the structure of the m-core of a directed net- 
work, we study the evolution the degree distribution of 
leaves during the leaf removal process. In particular, 
we must find when all following probabilities become 
zero: P(0,1,C) = ^^(1,1,0 = ^(2,1,CJ = - - 
P(m,l,0 = P(l,0,O = P(1,1,C) = ^(l,2,t;j = 



FIG. 6. m-core decomposition for the directed Erdos-Renyi 
graph, (a) The relative sizes and (6) the normalized numbers 
of edges of m-cores versus co (which is the mean in- or out- 
degree of a vertex in a network). 

... = P{l,m,t^) = 0. Our numerical results data for 
different networks show that P(1,0) = P(0, 1) is the 
last probability to become zero, i.e., the leaves with 
Qin = l,gout = and qm = O^Qout = 1 are the last 
to disappear. Hence, to find the time tj^ at which the 
pruning process will be completed, we should focus on 
the evolution of P(l, 0, t) = P(0, 1, t). After we find t*„^, 
the size and the number of edges of the m-core can be 
obtained from the following relations: 

iV™e = ^[l-P(0,0,t:j], (9) 

L,nc = (10) 

At first, let us study removing of all possible leaves 
from a directed network, which produces the main core of 
the network (or simply a core). Note that because in our 
leaf removal algorithm, in- and out-leaves are chosen with 
equal probability, we can consider only symmetric in, out- 
degree distributions satisfying P{qin,qout) = P{qout,qin)- 
We study networks in which in- and out-degrees of ver- 
tices are uncorrelated, i.e., P{qin,qout) = P{qin)P{qout)- 
We apply the algorithm to a few directed uncorrelated 
networks with various degree distributions, namely, (1) 
a directed Erdos-Renyi graph in which both P{qin) and 
P[qout) are Poissonian, i.e., P{q) ~ e^'^fcp/g!, (2) scale- 
free networks with different values of the degree distribu- 
tion exponent, in which both P{qin) and P{qout) are dis- 
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tributedasP(5) = [-(2z^]7-ir(g-7+l, i^)/r(g+ 
1) = q~'^ , and (3) a directed network for which both 
P{(lin) and P{qout) follow the same exponential law. 
In the leaf removal algorithm, which we perform, in- 
and out-leaves are chosen with equal probability, and 
so the evolving in, out-degree distribution remain sym- 
metric during the entire removal process. The results, 
namely, the relative sizes ric, and the normalized num- 
bers of edges, Ic, of the cores in these networks are shown 
in Fig.[SJ For the directed networks, cq denotes the mean 
in-degree, which is equal to the mean out-degree (half of 
the total mean degree). Comparing Figs. [H and [5] one 
can see that while the critical values of Cq in undirected 
and directed networks coincide, the dependencies nc(co) 
and ?c(co) in undirected and directed networks are essen- 
tially different from each other. This difference indicates 
that the problem of cores of directed networks cannot 
be reduced to that for undirected nets. In Ref. [ij], the 
core was found from numerical simulations of the directed 
version of the Gilbert {Gn,p) random graphs of 10^ ver- 
tices, which are equivalent to the corresponding Erdos- 
Renyi random graph in the infinite network size limit, 
and for directed scale-free networks (static model) with 
the same degree distributions as ours. We compared the 
dependence nc(co) which we obtained for the directed 
Erdos-Renyi graph and for the directed scale-free net- 
work with 7 = 4, see Fig. [5l[a), with those in Figs. S9(b) 
and (d) from the Supplementary material of Ref. jT3| and 
observed a close agreement. 

Finally we study the leave removal process involving 
only leaves with in-degree or out-degree less than or equal 
to m, which generates m-cores. There is some maxi- 
mum value of m at which the corresponding m-core co- 
incides with the main core. This number should depend 
on Cq. To demonstrate the m-core decomposition we use 
a directed Erdos-Renyi graph, for which the number of 
equations in Eq. ([7]) can be truncated to a small num- 
ber due to the rapidly decaying Poisson degree distribu- 
tion. We study the range of co between and 5, choosing 
qin,max = qout,max = 20, and solvc numerically 21^ = 441 
rate equations for each m fixing the parameter cq. Fig- 
ure ini shows the relative size and the normalized number 
of edges for each of m-cores. The most right curves in 
this figure show nc(co) and Ic{cq) for the main core. One 
can see that in the range e < cq < 3.0, the 5-core co- 
incides with the main core. Note that each of m-cores 
emerges continuously. These phase transitions are of the 



second order, and, in the critical region, the m-core's size 
is proportional to the deviation of cq from a critical point. 

IV. CONCLUSIONS 

In the present paper we applied the rate equation ap- 
proach to the problem of cores in directed networks. The 
directedness of edges in these networks results in a wide 
diversity of leaves, which makes this problem essentially 
more complex than for undirected networks. We devel- 
oped the leave removal algorithm involving specific leaves 
and defined the resulting set of m-cores forming an onion- 
like structure. Studying the evolution of the degree dis- 
tribution of various networks during this algorithm, we 
obtained the basic characteristics of all these cores and 
the critical points. We suggest that the equations, which 
we derived for uncorrelated networks, are asymptotically 
exact. We checked that the result of the application of 
this approach to the undirected Erdos-Renyi graph com- 
pletely agrees with the exact result obtained by the gener- 
ating function technique. We also found that our results 
for directed networks are in a close agreement with those 
obtained in simulations. 

For the numerical solution of the rate equations we 
used standard programs for nonlinear differential equa- 
tions, and, for a given finite gmaxj our results, in principle, 
could be obtained with any desired precision. We did not 
however study scale-free networks with degree distribu- 
tions exponent 7 smaller than, approximately, 3 because 
of the too large number of the rate equations which we 
had to solve numerically. The case of 7 < 3 is a challenge 
for future work. 

In summary, we have found an effective way to find 
the complex structure of cores in directed networks. We 
suggest that, similarly to cores in undirected graphs, the 
understanding of cores in directed networks and the pro- 
posed m-core decomposition will shed light on a wide 
range of phenomena in networks. 
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